Risks of double-counting in deep sequencing.
نویسندگان
چکیده
Lou et al. (1) report a technique for increased sensitivity of DNA sequencing that they call “circle sequencing.” The authors’ method involves circularizing single-stranded DNA and performing rolling-circle amplification (RCA). Multiple duplicates are thereby generated, and errors can be removed by comparing the sequences of the duplicates. Lou et al. report accuracy similar to that of singlestranded tagging techniques (2, 3), but less than that achieved with double-stranded tagging (4). The authors propose their method as a high-efficiency tool for deep-sequencing of heterogeneous DNA samples. To obtain accurate deep-sequencing from a targeted region, it is necessary to ensure that each starting molecule is counted only once in the final sequence data. However, the method, as described, does not include steps to avoid multiple counting of the same starting molecule. Overcounting could occur for several reasons: (i) A large excess of random DNA primers is used. Thus, each template may be primed multiple times, thereby generating multiple extension products from each circular molecule. (ii) RCA results in long DNA products. The authors shear the DNA after the RCA step to an average length of three duplicates. Thus, the RCA reaction might generate 12 duplicates of a given molecule, for example, which is then sheared to generate four smaller molecules. All four of these molecules could inadvertently be scored as independent molecules. (iii) Y-shaped adapters are annealed to the amplified DNA. Y-adapters independently amplify each of the two strands, and could thereby overestimate the data yield by a factor of two. (iv) The assay uses double-stranded DNA. Each of the two initial DNA strands can be circularized and counted as unique molecules, potentially overestimating data yield by an additional factor of two. These problems can be overcome; because their starting DNA is randomly sheared before circularization, unique molecules can be identified by virtue of having unique circularization points. Erroneous scoring of duplicate molecules could thereby be avoided by filtering out consensus sequences that have matching shear points or transposed shear points (to account for molecules arising from the complementary strand). Although it is conceivable that these problems do not affect the dataset described in the paper by Lou et al. (1), because the number of input circles substantially exceeded the number of reads produced (1,000to 100,000-fold; table S2 in ref. 1), incorporation of shear-point filtering should be considered in future applications. Notably, for ultradeep-sequencing applications, such as detection of specific rare mutations among tumor cell populations, obtaining such molar excesses of a targeted genomic region is often not practical. There are a limited number of possible shear points flanking any given DNA sequence, and shear points do not occur randomly. Thus, the maximal depth that can be obtained by circle sequencing with shear-point filtering is limited. This limitation could be overcome by incorporation of a random tag sequence into the workflow to uniquely label each starting molecule before amplification. With this modification, the circle sequencing assay could theoretically be extended to allow for unlimited sequencing depth with no risk of overcounting.
منابع مشابه
Identification of microRNAs in corpus luteum of pregnancy in buffalo (Bubalus bubalis) by deep sequencing
This study was aimed to identify miRNAs of corpus luteum (CL) in buffaloes during pregnancy. For this study, CL (n=2) were collected from gravid uteri of buffalo and RNA was isolated. Following this, the purity and integrity of RNA was checked and used for deep sequencing using Illumina Hiseq 2500 platform. The reads’ quality was checked prior to in silico analyses viz. identification of conser...
متن کاملCounting Statistics of a Bose-Einstein Condensate in a Double-Well Potential
This thesis aims to investigate the dynamic properties of ultra-cold Bose-Einstein condensates (BEC) trapped in a deep double well potential. Despite its simplicity, this system shows a rich variety of dynamical behaviors: Josephson oscillations, self-trapping behavior and critical dynamics. In contrast to previous studies, we aim to understand not only the first moment of the evolving occupati...
متن کاملMonitoring the immune response to vaccination with an inactivated vaccine associated to bovine neonatal pancytopenia by deep sequencing transcriptome analysis in cattle
Bovine neonatal pancytopenia (BNP) is a new fatal, alloimmune/alloantibody mediated disease of new-born calves induced by ingestion of colostrum from cows, which had been vaccinated with a specific vaccine against the Bovine Virus Diarrhoea Virus (BVDV). The hypothesis of pathogenic MHC class I molecules in the vaccine had been put up, but no formal proof of specific causal MHC class I alleles ...
متن کاملDeep sequencing reveals double mutations in cis of MPL exon 10 in myeloproliferative neoplasms.
Somatic mutations of MPL exon 10, mainly involving a W515 substitution, have been described in JAK2 (V617F)-negative patients with essential thrombocythemia and primary myelofibrosis. We used direct sequencing and high-resolution melt analysis to identify mutations of MPL exon 10 in 570 patients with myeloproliferative neoplasms, and allele specific PCR and deep sequencing to further characteri...
متن کاملDouble Thyroglossal Duct Cyst Derived from a Single Tract: A Rare Presentation
Introduction: The advent of antibiotics and the improvement in dental hygiene have made the occurrence of deep neck infections less frequent than in the past. Nevertheless, the complications of these infections are often life threatening. The purpose of this study was to review the clinical findings in deep neck infections and identification predisposing factors of these complications. Material...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 111 16 شماره
صفحات -
تاریخ انتشار 2014